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A  STABILITY  PROPERTY  OF  CONDITIONAL  EXPECTATIONS 


John  M.  Morrison 
Department  of  Mathematics 
University  of  Texas  at  Austin 
Austin,  Texas  78712 


ABSTRACT 

This  paper  is  concerned  with  approximating  a 
conditional  expectation  of  a  second  order  random 
variable  given  a  random  process  defined  over  an 
interval  by  a  conditional  expectation  of  the 
random  variable  given  distorted  values  of  the 
random  process  at  finitely  many  times.  A  suffi¬ 
cient  condition  which  guarantees  a  good  approxi¬ 
mation  is  presented.  Best  estimates  of  more 
general  fidelity  criteria  than  mean  square  error 
are  also  considered,  and  the  above  situation  is 
addressed  for  a  wide  class  of  fidelity  criteria. 

I.  INTRODUCTION 

Throughout  this  paper  let  (ft„</,P)  be  a  fixed 
probability  space  and  (M, p )  be  a  separable  met¬ 
ric  space.  Suppose  {X ( t ) : te[0 ,T] }  is  a  stochas¬ 
tic  process  on  (fl,.?;P)  taking  values  in  M  that 
is  continuous  in  probability  and  Y€L2(fi,#P) .  In 

many  theoretical  situations  one  is  interested  in 
E(Y|X(t):te[0,T]).  This  is  the  optimal  o(X(t), 
te[O.T])-measurable  mean  square  estimate  of  Y 
given  perfect  knowledge  of  the  process  {X ( t ) : 
te[0,T]j  at  all  times  te[0,T];  that  is,  it  is 
the  unique  solution  [4,  pp. 43-45]  to  the  problem: 
min  {|!»-2||L  ZeL2(n,o(X(t):t«CO,T]),P)}. 

However  in  many  practical  situations  we  are 
neither  able  to  observe  the  process  continuously 
nor  do  we  have  perfect  knowledge  about  the  pro¬ 
cess  when  we  are  able  to  observe  it.  Conven¬ 
tional  measuring  devices  and  computers  can  only 
handle  finite  data  sets.  Effectively,  they 
partition  M  into  finitely  many  disjoint  subsets 
Ej,...,En  and  register  a  fixed  value  vk  of  Ek  if 

they  observe  xeE.,  1IkIn-  These  devices 

are  commonly  unable  to  observe  the  process  at 
all  times  te[0,'i].  Our  question  becomes:  How 
well  :an  we  estimate  E(Y|X(t):te[0,T])  given  our 
defective  knowledge  of  (X(t) :te[0,T]}  at  only 
finite),’  a  any  times  t,,...,t  belonging  to 
[0,’J?  1  n 

More  generally,  we  are  tempted  to  ask  this 
question  about  best  estimates  of  more  general 
fidelity  criteria  than  mean  square  error.  In 
this  paper  we  will  address  this  question  for  a 
very  wide  class  of  fidelity  criteria. 

II.  ROUND  OFF  SCHEMES 

Defini tion:  let  Q:M-*M  be  Borel  measurable  and 
have  finite  rarge,  say  !p^,...,pn).  The  map  Q 

Is  said  to  be  a  round  off  map  if  pk  *Q(pk), 

1  ik  '  n.  The  set  (Q‘ 1  ( p1 ) , . . .  ,Q‘ 1  (pn) }  is 
called  the  paj-tition  of  M  defined  by  Q. 

Definition:  Let  {Q  "  ,  be  a  sequence  of  round 

- — . —  ^n  n=  l 
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off  maps  on  M.  The  sequence  {Qn>”=^  is  called  a 

round  off  scheme  if 

(i)  V  xeM  lim  dia  Q'^Q  ( x ) )  =  0 
n  -►  oo  n  n 

and 

(11)  the  partition  of  M  defined  by  Qn+1  refines 
that  defined  by  Qn>  nelN.  Note  o(Qn)Co(Qn+^). 

The  action  of  these  maps  suggests  a  sequence 
of  increasingly  accurate  measuring  devices.  We 
will  show  that,  asymptotically,  these 
distinguish  Borel  sets  in  M  via  the 

Lemma  1 :  V  o(Q  )  =J8(M),  the  Borel  sets  in  M. 
n=  I  ” 

Proof:  C:  Obvious,  since  we  require  each  Q  to 
be  Borel  measurable.  n 

D:  Choose  any  open  UCM.  Pick  xeU; 

11m  Q“ 1  { ( x ) )  =0  so  there  is  nelNs.t. 
n  -voo  n  n 

Q^(Qn(x))CU.  Thus  U  may  be  written  as  a  union 

of  point  inverses  of  the  Qn-  Since  there  are 

only  countably  many  of  these,  the  union  is 

countable  so  Ue  V,  o(Q„).  Since  V.  o(Q  )  is  a 
n=i  n  n=i  n 

o-algebra  on  M  containing  every  open  subset  of 

M,  we  conclude  J6(M)CnV1o(Qn).  QED 

Lemma  2:  Let  X:ft-<-M  be  Borel  measurable.  Then 

o(X)  =  V,  o(Q  (X) ) . 
n=l  n 

Proof:  This  is  an  easy  application  of  the  "good 
sets"  principle  described  in  [3,  p.5]. 

Theorem  3:  Let  X:ft-*-M  be  Borel  measurable. 
Tips*,  and  YeL(fi,#P).  Then  E(Y|Q  (X)) 

i^-^E(Y|X). 

Proof:  [3,  p . 30 1  ]  demonstrates  that  if 

is  an  increasing  collection  of  o-algebras  on 

00 

contained  in  and  3  -  V  3  ,  then 
°°  n=l  n 

E(V|.jFn)  >E(Y|.*J  aifi 

Martingale  convergence  theorems  allow  us  to 
asymptotically  reconstruct  E(Y|X)  from 
E(Y | Qn ( X ) ) ;  see,  for  instance,  [7,  Chap.  7). 

III.  THE  L;  CASE 

Notation:  Henceforth  for  convenience  we  will 
assume.?'  is  complete.  If  3Q:f_i%  a  o-algebra, 
we  denote  its  P-completion  by  3 . 

First  we  dispose  of  a  technicality. 

Lemna  4 :  Let  (X(t) :te[0,T])  be  a  process  on 
(fl,.?’,P)  continuous  in  probability  and  Dc[0,T] 
be  dense.  Then 
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Proof:  3:  Obvious. 

C:  Fix  an  open  UeM,  set  U_  *  {xeM:p(x,Uc) 

> -} ,  and  let  Um  denote  the  topological  closure 

of  U  .  It  is  easy  to  see  that  {U  }”  ,  is  a  non- 
m  m  m=i 

decreasing  collection  of  open  subsets  of  M  and 

U  =  U  Um.  Pick  te[0,T].  Since  {X(t),te[0,T]} 
m=l  m 

is  continuous  in  probability,  there  exists  a 
sequence  { tn }”=-j  in  0  such  that  t  -»-t  and  X(tn )  -*• 

X(t)  a.s.  Pick  u«X(t)"1(U);  there  exists  meW 

such  that  X(t)(w)ell  .  Suppose  Tim  X(t  )(u)  = 
m  n  -+■  c»  n 

X(t)(aj);  then  Um  is  a  neighborhood  of  X(t)(w)  so 
that  there  exists  NeW  such  that  for  all  n  ^N, 
x(tn)U)eum.  Thus,  ^  nn  . 

U  lim  inf  X(t„)'1(U  ),  which  we  will  define  as 
m=l  n  >°°  **  m 

A.  Conversely,  suppose  weA  and  that  11m  X(t  )(w) 

n  ->oo  o 

=  X ( t ) ( co) .  Then  there  exists  net*  such  that  we 
lim  inf  X(t)'](Um)  and  X(t) (iu)eU  C  U.  We  have 

n  >  co  iii  hi 

just  shown  AAX(t)’ '(U)Ci2-{u)efi:lim  X(t  ) (co)  * 

n  n 

X( t ) (co) ) .  Since  X(t_)  -►X(t)  a.s.,  we  see  that 
_  j  n 

AAX(t)  (U)  has  zero  probability;  and  since  Ae 
o(X(t),teD),  we  see  that  X(t)’1  (U)  eo(X(t),teD). 
Thus  for  any  Borel  BCM,  X(t)~](B)  eo(X(t):teD). 
It  follows  a(X(t) :te[0,T])Co(x(t) :teO) .  OEO 

Definition:  A  partition  P  of  the  closed  Interval 
[O.T]  is  a  finite  point  set  {0=tQ  <  t1  ...<tn*T). 

The  mesh  of  P  is  defined  by  p(P)  'max  { t. -t.  ,: 

lf_k<n7.  * 

Lemma  S:  Let  YeL0(S!,.'/;P)  and  be  an 

- - -  t  m  m=  i 

increasing  sequence  of  partitions  of  [O.T]  with 

u(P  )  *0.  If  (X(t)  ,te[0,T]}  is  a  process  on 

(Q,./;P)  continuous  in  probability,  then  E ( Y | X( t ) : 
tePm)  >E(Y| X(t) : t€[0,Tj)  in  L?  and  a.s. 

Proof:  Set  D  -  U,  P  ;  u(Pm)  -*0  so  D  is  dense  In 

[O.T],  Thus  E(Y|Xt:tePm)  ^^EfYlX^teD). 

Lemma  4  implies  L( Y 1 X( t) : teD)  -  E(Y | X{ t) : te[0,T]) 
a.s. 

Theorem  6^:  Le..  Lemma  5  set  notation  and  (Q  }”  , 
Be  a  round  off  scheme  on  M.  Then 

lim  E(V|Q  (X(t)):teP  )  *  E( Y | X( t) : t«[0,T]) 
m,n  *  ■  n  m 

in  L2. 

Proofj  For  m.neIN  put 
V  ■  IIE(V|Qn(x(t)):tePm) 


Then  it  is  clear  J  C  _  and  .f  Q3 

mn  m+1 ,n  m,n  rn.n+l 

rn.nelN.  For  each  m  we  have 


-E(Y  j X(  t)  •  te[0,T] || L 


■'mn  -  -(Qn(x(t)).t€pm>. 


E(Y|Qn(X(t)):tePm) 


E(Y|X(t):tePj 


as  n  -*■«.  Now  letting  m-<-»  and  applying  Lemma  4, 
lim  lim  am  =0.  For  each  nelN, 

m  >«  p  >» 


E(Y|Qn(X(t)):tePm) 


L2, a.s. 


E(Y|Qn(X(t)):te[O.T]). 


An  easy  extension  of  Theorem  3  shows 

E(Y|Qn(t)):te[0,T])ti^.  E(Y|X(t):te[0,T]). 

Thus  lim  lim  a_  =0. 
n  >oo  ni  >co  mn 

Now  turn  to  the  L2  minimization  property  of 

the  conditional  expectation  operator  to  see  that 

a_. ,  <  a  and  am  ,  <  a  ,  rn.nelN.  It 

m+i,n  —  mn  m,n+l  —  mn 

immediately  follows  lim  a  =0. 

m,n-*»  mn 

2ED 

IV.  AN  ABSTRACT  PRINCIPLE  OF  BANACH  SPACES 
Definition:  A  Banach space  B  is  uniformly 
convex  if  Tor  all  e  >0  there  exists  6  >0  s.t. 
for  all  x.yeB  with  ||x||  =||y||=l,  ||  x-y  1 1  >  e 
implies  ||  x+y||  >2(1-6).  A  Banach  space  B  is 

locally  uniformly  convex  if  for  any  sequences 

{xn>n*l  and  {Vn*l  w1th  HH"  IM"  1. 
neW,  ||  xn-eyn|[  -  2  implies  ||*n-ynll  -  0.  A 

Banach  space  B  is  strictly  convex  if  each  point 
of  the  unit  sphere  is  an  extreme  point  of  the 
closed  unit  ball. 

It  is  well  known  that  uniform  convexity 
implies  local  uniform  convexity,  which  in  turn 
implies  strict  convexity. 

We  denote  the  metric  of  B  as  d. 

Theorem  7:  Let  B  be  a  reflexive  Banach  space 
and  KcB  be  closed  and  convex.  Then  for  any  xeB 
the  set  L  » {yeK:d(x,y)*d(x,K)}  is  closed  and 
nonvoid. 

Proof:  See  [12,  sections  38  and  39]. 

Theorem  8:  Let  B  be  a  reflexive  Banach  space. 
Then  B  is  strictly  convex  if  and  only  if  for  all 
xeB  and  for  all  closed  and  convex  KcB  there 
exists  a  unique  yeK  such  that  d(x,y)  =  d(x,K). 
Proof:  This  is  an  easy  consequence  of  Theorem  7. 

Rote  that  if  B  is  any  Banach  space  so  that 
for  any  xeB  and  any  closed  and  convex  KcB  there 
exists  a  unique  yeK  s.t.  d(x,y)  =d(x,K),  then  B 
is  strictly  convex  and  reflexive.  For  a  proof 
see  [9,  p . 161]. 

Theorem  9:  Let  {K  .  be  an  increasing  collec- 

tion  of  closed  convex  subsets  of  a  strictly 
convex  reflexive  Banach  space  B  and  let  K^  be 

the  norm  closure  of U  K  .  Note  that  K  is 
n  n 

closed  and  convex.  Let  P  denote  minimum  norm 

n 

projection  on  Kn>  nelNU{»};  this  is  well 

defined  by  Theorems  7  and  8.  Then  for  all  xeB 
we  have 


Avail  and/c 
Special 


vh  m 


Il*-paj(*)ll  *  H®  J|x-P  (x)||. 

n  -pod  n 

Proof:  Fix  xeB.  By  the  minimality  of  the  pro- 
jectlons  Pn,  neNuM,  we  have  ||  x-P^x)  ||  < 

II  x-Pn+1(x)ll  <llx-Pn(x)ll,  new.  Thus 

11m  [ I  x- P_ ( x)  ||  exists  and  Is  not  less  than 

n+®  n 

II  x-PoJx)  || .  Conversely,  choose  e  >0.  Note  that 

P^xJeK^  implies  that  there  exists  neW  and 

yeKn  s.t.  ||x-Pn(x)||  <  || x-y ||  +  Ily-Pjx)!! 

<_  |j x-P^x) ||  +  e.  The  arbitrariness  of  e  implies 

that  lim  ]|  x-P  (x)  ||  <  ||x-Pjx)||.  flED 

n  -*-co  n 


x-Pn(x) 


Theorem  10:  Let  B  be  a  reflexive  strictly 

convex  Banach  space  and  let  {K  }°°  , ,  K  ,  {P  }“  , , 

n  n=  i  °°  n  n= i 

and  P^  be  as  in  Theorem  9.  Suppose  x,zeB  and 
Pn(x)  ->■  z.  Then  z  ’Pjx). 

Proof:  Note  that  Pn(x)eKn,  nelN,  and  is 
weakly  closed,  so  zeK^.  By  the  weak  lower 
semi  continuity  of  the  norm  and  Theorem  9,  ||x-z|| 
<  lim  inf  ||x-P  (x)||  *  1 1 x- P  (x)||  .  Thus  we 

“  n  -+  co  n  00 

conclude  z=Poo(x).  QED 

Theorem  1 1 :  Let  the  previous  theorem  set 
notation.  Then  Pn(x)  -*  P^x). 

Proof :  Choose  any  subsequence  {Pn)((x)}  of 
{Pn(x)}“_r  By  the  Smul'lyan  theorem  [9,  pp.  145- 
156]  there  exists  a  further  subsequence 

{P"k(j)(x))  °f  {Pnk(x)}  and  Z€B  st-  Pnk(j)(x) 

-»  z.  Theorem  10  implies  z*Pjx),  We  conclude 
that  Pn(x)  -  Pjx).  SED 

Proposition  12:  Let  B  be  a  locally  uniformly 
convex  Banach  space  and  { >”_ i  be  a  sequence  in 


B  with  x_ 


x  and  ||xn|| 


Then  x. 


in  norm. 

Proof:  See  [8,  p.233]. 

theorem  13:  Let  B  be  a  locally  uniformly  convex 

Banach  space  and  lx_}“  ,,  { P_ }°“  ,,  K  ,  and  P  be 
n  n- i  n  n=i  °°  « 

as  in  Theorem  9.  Then  for  any  xeB,  P  ( x )  -* 

P  (x)  in  norm.  n 

Proof:  Recall  that  local  uniform  convexity 
implies  strict  convexity,  so  minimum  norm 
projections  are  defined.  Pick  xeB;  Pn( x)  -» 

PJx)  implies  x-Pn(x)  -*•  x-P^fx).  But 


!l  x*Pn(x) || -*►  ||x-Pod(x)||  ,  so  the  theorem  follows 
from  Proposition  12.  (JE|) 
V '  CASE  OF  U 

The  basic  facts  about  Orlicz  spaces  we  use 
here  may  be  found  in  [6]  and  [10].  Henceforth 
we  stipulate  that  (iJ P)  be  nonatomic. 


Throughout  we  will  assume  that  our  Young 
function  ♦:  [0,») -*■[(),“)  has  strictly  increasing 
first  derivatives  on  [0,«°)  and  that  4>  and  its 
complementary  Young  function  y  satisfy  the  Ag  or 

doubling  condition.  Recall  the  Luxemburg  norm 
of  YeLjifi.^.P)  is  defined  by 

N*PO  -  inf  >0:  J  *  ( -LfL  )  dP  <  «(1 ) } 

and  that  for  a  sequence  in  L^,  N(J>(Yn-Y) 


lim  [  ♦( | Y  -Y | )dP  =  0. 

H  -poo  ■'n  »* 


Furthermore,  this  norm  makes  L$  a  reflexive 

uniformly  convex  Banach  space.  Thus  the 
machinery  of  the  last  section  applies.  Note 
however  that  in  general  these  minimum  norm 
projections  are  nonlinear. 

Now  let  SF  be  any  sub  o-algebra  of  !f  and 
YeL^n.^.P).  The  set  Llf(n,y,P)  is  a  closed 

subspace  of  L^(il,^,P)  so  Y  has  a  unique  minimum 

norm  projection  into  L^fl.^.P)  which  we  will 

denote  by  E^(Y|^).  The  primary  tool  used  in 

the  Lg  case  was  the  martingale  convergence 

theorem;  we  will  obtain  an  analog  of  it  here. 

Lemma  14:  Let  YeL#(n,.tfP) ,  he  an 

increasing  collection  of  sub  o-alqebras,  of  !f 

and  dr  «  V  jr  .  Then  u  L*(n,jr,P)  = 

“  n*l  n  n-1  *  " 

L^n.^.P). 

Proof:  Put  Zn  =  E(Y|./n),  nelNuM.  Repeated 
application  of  Jensen's  inequality  yields: 

0  <  *(|Zn|)  =  *(|E(Y|.jT)|) 

<  <*>(E(|Y|  I  Jn\) 

1  E(*(|Y|)|^n). 

dominating  {♦(IZ  |)}  ^  by  a  uniformly  integrahle 

sequence  of  functions.  Thus,  {<t>( | Zp | is 

uniformly  integrable.  Now  apply  convexity  and 
the  doubling  condition,  yielding 

♦<IVO>  <♦<!*«!  ♦  |ZJ) 

1  y  *(2|ZJ)  +  \  *<2IZJ> 

<  |  •(|Z„|)  *f  •<|Zj>. 

where  c  is  a  constant  from  the  doubling  condi¬ 
tion,  independent  of  n.  It  follows  now 

{$(|Zn-Zm| )}*^  is  uniformly  integrable.  Since 

*(lzn-zJH°  a.s.,  /*(|Zn-Zj)dP-0  and 

N.(Z  -Z  )  -0.  The  lemma  follows  immediately. 

.  SIS 

Theorem  15:  Let  be  an  increasing 

collection  of  sub  a-algebras  of  if  and  Jf  - 
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fc 


i 


V  jrn.  Then  if  YeL$<ft,tfP),  E(Y|^n)-^  E(Y|<*J. 

Proof:  Apply  Lemma  14  and  Theorem  13.  QED 

Remark:  Consulting  [1],[2],[5],  and  [11]  it  is 
possible  to  see  this  convergence  is  almost  sure. 
Now  we  extend  Lemma  5: 

Lenina  16:  Let  YeL4(ft,#P)  and  {Pm}“=1  be  an 

increasing  sequence  of  partitions  of  [0,T]  with 
^(P^)  ->0.  If  {X(t):te[0,T]}  is  a  process  on 

P)  taking  values  in  M  that  is  continuous  in 
probability  then 


:< 

E*(Y|X(t) 

£ 

as  m  •*<*>. 

1 

lim  f 

m  •»<»  Ju 

: 

Proof:  I 
-  1 

a.s. 


-*  E^<Y|X(t):te[0,T]) 


Furthermore, 


•E$(Y|X(t):te[0,T])|)  dP  =  0  . 


and  (Q  }  ,  be  a  round  off  scheme  on  M.  Then 

n  n=  I 

E*(Y|Qn(X(t)):tePm)  -U  E$(Y|X(t):te[0,T]) 
and 

f  ♦(|E#(Y|Qn(X(t):tePj-E>(V|X(t):tf[0,T)|)dP*0 


,N 


■ 


as  m,n  ■*<*. 

Proof:  For  m.nelN  set  ,fm  -  o(Qn(X(t))  :tePm). 

Choose  sequences  and  (nk}“=^  so  that 

m.  ,n.  t<».  put  ,i.  =  .f  ,  kelN.  Then 
k  k  k 

V -  V  V  .  Apply  Theorem  15.  QED 

k  k  m,n  mn  a— 

Finally,  for  icing  on  the  cake  we  get  a 
similar  result  for  ordinary  conditional 
expectation: 

Theorem  18:  Let  Theorem  17  set  notation.  Then 
as  m,n  -*■“>, 

F(VIQn(X(t)>:t«Pm)  l±*  E ( Y |  X ( t) :  te[0,T]) 
and 

/'<f(!E(Y:Qn(X(t)):tePm)-E(Y|X(t):t€[0,T])|)dP-0. 

Proof:  Mimic  Theorem  17  using  the  fact  that 
E(Y|.*-k)-2*  £ ( y j x ( t ) : te[0,T]) 
derived  in  Theorem  13. 

QED 

Remark^  If  $(x)  =  xP/p,  xetO,™)  and  p>l,  L^=Lp 
and  the  nonatomicity  assumption  may  be  dropped. 
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